Efficient Beam Thresholding for Statistical Machine Translation

نویسندگان

  • Deyi Xiong
  • Min Zhang
  • Haizhou Li
چکیده

Beam thresholding is a widely-used pruning approach in decoding algorithms of statistical machine translation. In this paper, we propose two variations on the conventional beam thresholding, both of which speed up the decoding without degrading BLEU score. The first variation is the dynamic beam thresholding, in which the beam threshold varies with the length of source sequences covered by hypotheses. The second one incorporates a language model look-ahead probability into the beam thresholding so that the interaction between a hypothesis and the contexts outside the hypothesis can be captured. Both thresholding methods achieve significant speed improvements when used separately. By combining them together, we obtain a further speedup, which is comparable to that of the cube pruning approach (Chiang, 2007). Experiments also display that the dynamic beam thresholding can further improve the cube pruning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient A* Search Algorithm for Statistical Machine Translation

In this paper, we describe an efficient A* search algorithm for statistical machine translation. In contrary to beamsearch or greedy approaches it is possible to guarantee the avoidance of search errors with A*. We develop various sophisticated admissible and almost admissible heuristic functions. Especially our newly developped method to perform a multi-pass A* search with an iteratively impro...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

Word Reordering and a Dynamic Programming Beam Search Algorithm for Statistical Machine Translation

In this article, we describe an efficient beam search algorithm for statistical machine translation based on dynamic programming (DP). The search algorithm uses the translation model presented in Brown et al. (1993). Starting from a DP-based solution to the traveling-salesman problem, we present a novel technique to restrict the possible word reorderings between source and target language in or...

متن کامل

Incremental Decoding for Phrase-Based Statistical Machine Translation

In this paper we focus on the incremental decoding for a statistical phrase-based machine translation system. In incremental decoding, translations are generated incrementally for every word typed by a user, instead of waiting for the entire sentence as input. We introduce a novel modification to the beam-search decoding algorithm for phrase-based MT to address this issue, aimed at efficient co...

متن کامل

Statistical Post-Editing for a Statistical MT System

Statistical post-editing (SPE) techniques have been successfully applied to the output of Rule Based MT (RBMT) systems. In this paper we investigate the impact of SPE on a standard Phrase-Based Statistical Machine Translation (PB-SMT) system, using PB-SMT both for the first-stage MT and the second stage SPE system. Our results show that, while a naive approach to using SPE in a PB-SMT pipeline ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009